Multi-granularity for knowledge distillation
نویسندگان
چکیده
Considering the fact that students have different abilities to understand knowledge imparted by teachers, a multi-granularity distillation mechanism is proposed for transferring more understandable student networks. A self-analyzing module of teacher network designed, which enables learn from teaching patterns. Furthermore, stable excitation scheme robust supervision training. The can be embedded into frameworks, are taken as baselines. Experiments show improves accuracy 0.58% on average and 1.08% in best over baselines, makes its performance superior state-of-the-arts. It also exploited student's ability fine-tuning robustness noisy inputs improved via mechanism. code available at https://github.com/shaoeric/multi-granularity-distillation.
منابع مشابه
Granularity in Multi-Method
Multi-method planning is an approach to using a set of different planning methods to simultaneously achieve planner completeness, planning time efficiency, and plan length reduction. Although it has been shown that coordinating a set of methods in a coarse-grained, problem-by-problem manner has the potential for approaching this ideal, such an approach can waste a significant amount of time in ...
متن کاملSequence-Level Knowledge Distillation
Neural machine translation (NMT) offers a novel alternative formulation of translation that is potentially simpler than statistical approaches. However to reach competitive performance, NMT models need to be exceedingly large. In this paper we consider applying knowledge distillation approaches (Bucila et al., 2006; Hinton et al., 2015) that have proven successful for reducing the size of neura...
متن کاملKnowledge Distillation for Bilingual Dictionary Induction
Leveraging zero-shot learning to learn mapping functions between vector spaces of different languages is a promising approach to bilingual dictionary induction. However, methods using this approach have not yet achieved high accuracy on the task. In this paper, we propose a bridging approach, where our main contribution is a knowledge distillation training objective. As teachers, rich resource ...
متن کاملKnowledge Granularity and Action Selection
In this paper we introduce the concept of knowledge granularity and study its in uence on an agent's action selection process. Action selection is critical to an agent performing a task in a dynamic, unpredictable environment. Knowledge representation is central to the agent's action selection process. It is important to study what kind of knowledge the agent should represent and the preferred ...
متن کاملMulti-Granularity Noise for Curvilinear Grid LIC
A major problem of the existing curvilinear grid Line Integral Convolution (LIC) algorithm is that the resulting LIC textures may be distorted after being mapped onto the parametric surfaces, since a curvilinear grid usually consists of cells of di erent sizes. This paper proposes a way for solving the problem through using multi-granularity noise as the input image for LIC. A stochastic sampli...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Image and Vision Computing
سال: 2021
ISSN: ['0262-8856', '1872-8138']
DOI: https://doi.org/10.1016/j.imavis.2021.104286